Porting Decipher from English to Mandarin
نویسندگان
چکیده
This paper describes our efforts in porting the SRI Decipher English system into Mandarin for transcribing telephone conversations. This includes all aspects of the system: the pronunciation phone set and lexicon, word segmentation, pitch features, discriminatively trained acoustic models with parameter sharing determined by decision trees, and web-data augmented language models.
منابع مشابه
Porting the Galaxy System to Mandarin Chinese
Galaxy is a human-computer conversational system that provides a spoken language interface for accessing on-line information. It was initially implemented for English in travel-related domains, including air travel, local city navigation, and weather. Efforts were started to develop multilingual systems within the framework of galaxy several years ago. This thesis focuses on developing the Mand...
متن کاملAutomatic evaluation and training in English pronunciation
SRI is developing a system that uses real time speech recognition to diag nose, evaluate and provide training in spoken English. The paper first describes the methods and results of a study of the feasibility of automati cally grading the performance of Japanese students when reading English aloud. Utterances recorded from Japanese speakers were independently rated by expert listeners. Speech g...
متن کاملEnhancing Automatic Acquisition of the Thematic Structure in a Large-Scale Lexicon for Mandarin Chinese
This paper describes a reenement to our procedure for porting lexical conceptual structure into new languages. Speciically we describe a two-step process for creating candidate thematic grids for Mandarin Chinese verbs, using the English verb heading the VP in the subdeenitions to separate senses, and roughly parsing the verb complement structure to match to our thematic structure templates. Th...
متن کاملYINHE: a Mandarin Chinese version of the GALAXY system
The galaxy system is a human-computer conversational system providing a spoken language interface for accessing on-line information. It was initially implemented for English in travel-related domains, including air travel, local city navigation, and weather. We began an effort to develop multilingual systems within the framework of galaxy several years ago. This paper describes our recent work ...
متن کاملAcoustic features of vowel production in Mandarin speakers of English
English vowel productions were acoustically examined in a group of native Mandarin speakers. The first and second formant frequencies (F1 & F2) of 11 English vowels were examined in the syllable-level productions of 40 Mandarin speakers compared to 40 American English speakers. Results of the comparative acoustic analysis indicated that the Mandarin speakers differed significantly from the Amer...
متن کامل